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Abstract 

Spectrum sensing is a fundamental and critical issue for opportunistic spectrum access in cognitive 
radio networks. Among the many spectrum sensing methods, the information theoretic criteria (ITC) 
based method is a promising blind method which can reliably detect the primary users while requiring 
little prior information. In this paper, we provide an intensive treatment on the ITC sensing method. 
To this end, we first introduce a new over-determined channel model constructed by applying multiple 
antennas or over sampling at the secondary user in order to make the ITC applicable. Then, a simplified 
ITC sensing algorithm is introduced, which needs to compute and compare only two decision values. 
Compared with the original ITC sensing algorithm, the simplified algorithm significantly reduces the 
computational complexity without losing any performance. Applying the recent advances in random 
matrix theory, we then derive closed-form expressions to tightly approximate both the probability of 
false alarm and probability of detection. Based on the insight derived from the analytical study, we 
further present a generalized ITC sensing algorithm which can provide flexible tradeoff between the 
probability of detection and probability of false alarm. Finally, comprehensive simulations are carried out 
to evaluate the performance of the proposed ITC sensing algorithms. Results show that they considerably 
outperform other blind spectrum sensing methods in certain cases. 

Index Terms 

Cognitive radio networks, spectrum sensing, information theoretic criteria, random matrix theory. 

I. Introduction 

Due to the increasing popularity of wireless devices in recent years, the radio spectrum has 
been an extremely scarce resource. By contrast, 90 percent of the existing licensed spectrum 
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remains idle and the usage varies geographically and temporally as reported by the Federal 
Communication Commission (FCC) [1J. This indicates that the fixed frequency regulation policy 
conflicts drastically with the high demand for frequency resource. Cognitive radio (CR) is one 
of the most promising technologies to deal with such irrational frequency regulation policy 0, 
[fJl and has received lots of attention. In cognitive radio networks, secondary (unlicensed) users 
first reliably sense the primary (licensed) channel and then opportunistically access it without 
causing harmful interference to primary users H|. By doing this, the spectrum utilization of 
existing wireless communication networks can be tremendously improved. FCC has issued a 
Notice of proposed Rule Making to allow the unlicensed CR devices to operate in the unused 
channel [5J. The IEEE has also formed the 802.22 working group to develop the standard for 
wireless regional area networks (WRAN) which will operate on unused VHR/UHF TV bands 
based on cognitive radio technology. Both of these activities will significantly change the current 
wireless communication situation. 

As mentioned above, the secondary users need to opportunistically access the unused licensed 
channel while causing negligible interference to the primary users. As a result, the detection 
of presence of primary users is a fundamental and critical task in the cognitive radio networks. 
Although the detection of presence of signals is known as a classical problem in signal processing, 
however, sensing the presence of primary users in a complicated communication environment, 
especially a CR-based network, is still a challenging problem from the practice perspective. This 
is mainly due to the following two limiting factors: Firstly, it is very difficult, if not possible, 
for the secondary user to obtain the necessary prior information about the signal characteristics 
of the primary user for most of the traditional detection techniques to apply. Secondly, the CR 
devices should be capable of sensing the very weak signals transmitted by primary users. For 
instance, the standard released by FCC has required that spectrum sensing algorithms need to 
reliably detect the transmitted TV signals at a very low signal-to-noise ratio (SNR) of at least 
-18dB H. 

Thus far, there are mainly four types of spectrum sensing methods: energy detection (61, 0, 
matched filtering (coherent detection) flU, feature detection @ and eigenvalue-based detection 
lfT0l - lfT2l . Among them, energy detection is optimal if the secondary user only knows the local 
noise power [13J. The matched-filtering based coherent detection is optimal for maximizing the 
detection probability but it requires the explicit knowledge of the transmitted signal pattern (e.g., 
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pilot, training sequence etc.) of the primary user. The feature detection, often referred to as the 
cyclostationary detection, exploits the periodicity in the modulation scheme which, however, 
is difficult to determine in certain scenarios. By constructing the decision variables based on 
eigenvalues of the sampled covariance matrix to detect the presence of the primary user, the 
eigenvalue-based sensing methods presented in lfT0l - lfT2l do not need to estimate the power of 
the noise and hence are more practical in most CR networks. Recently, several new spectrum 
sensing schemes by incorporating system-level design parameters have been introduced, such 
as throughput maximization lfl4l - lfT6l and cooperative sensing using multiple nodes [fT71 - [|20ll . 
Nevertheless, the aforementioned four types of sensing techniques are still treated as a basic 
component in these new schemes. 

In this paper, we study a blind spectrum sensing method based on information theoretic 
criteria (ITC), an approach originally for model selection introduced by Akaike J2TJ, ||22l| and 
by Schwartz [23J and Rissanen Il24ll . Applying information theoretic criteria for spectrum sensing 
was firstly introduced in ll25l - [|28l . This work provides a more intensive study on the ITC sensing 
algorithm and its performance. The main contributions of this paper are as follows: 

• First of all, to make the information theoretic criteria applicable, a new over-determined 
channel model is constructed by introducing multiple antennas or over sampling at the 
secondary user. 

• Then, a simplified information theory criteria (SITC) sensing algorithm which only involves 
the computation of two decision values is presented. Compared to the original information 
theory criteria (OITC) sensing algorithm in [25J, SITC is much less complex and yet almost 
has no performance loss. Simulation results also demonstrate that the proposed SITC based 
spectrum sensing outperforms the eigenvalue based sensing algorithm in [10] and almost 
obtains the similar performance with ifTTI . The proposed sensing algorithm also enables a 
more tractable analytical study on the detection performance. 

• Applying the recent advances in random matrix theory, we then derive closed-form ex- 
pressions for both the probability of false alarm and probability of detection, which can 
approximate the actual results in simulation very well. 

• Finally, based on the insight derived from the analytical study, we further present a gener- 
alized information theory criteria (GITC) sensing algorithm. By involving an adjustable 
threshold, the proposed GITC can provide flexible tradeoff between the probability of 
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detection and probability of false alarm in order to supply different system requirements. 
The rest of paper is organized as follows. In Section II, the preliminary on the information 
theoretic criteria is provided. The proposed over-determined system model is presented in Section 
III. Section IV gives the proposed SITC sensing algorithm and the theoretical analysis of 
its detection performance, followed by the GITC sensing algorithm in Section V. Extensive 
simulation results are illustrated in Section VI. Finally, Section VII offers some concluding 
remarks. 

Notations: £[■] denotes expectation over the random variables within the brackets. Tr(A) stands 
for the trace of matrix A. Superscripts (-) T and (-)t denote transpose and conjugate transpose. 

II. Preliminary on the Information Theoretic criteria 

Information theoretic criteria are an approach originally for model selection introduced by 
Akaike [|2Ti . Il22l and by Schwartz [23J and Rissanen ll24l . There are two well-known criteria 
that have been widely used: Akaike information criterion (AIC) and minimum description length 
(MDL) criterion. One of the most important applications of information theoretic criteria is to 
estimate the number of source signals in array signal processing [|29l . Consider a system model 
described as 

x = As + fi, (1) 

where x is the p x 1 complex observation vector, A is a p x q (p > q) complex system matrix, 

s denotes the q x 1 complex source modulated signals and /i, is the additive complex white 

Gaussian noise vector. It is noted that the definite parameters q, A and a 2 are all unknown. The 

resulting cost functions of AIC and MDL have the following form [|29l : 

/ttp iV(p-k) \ w ( p - fe ) 
AIC(fc) = -21og ^ + 2k(2p-k)+2, (2) 

/rtP jl/(p-*)\ /i A 

MDL(A;) = - log ( _T^V fc j ) + [^P ~ *) + £ J logJV, (3) 

where N signifies the observation times and ^ denotes the 2-th decreasing ordered eigenvalue 
of the sampled covariance matrix. The estimated number of source signals is determined by 
choosing the minimum © or ©. That is, 

/caic = arg min AIC(j), (4) 

J'=0,l,...,p-1 



&mdl = arg min MDL(j). 

j=0,l,...,p-l 



(5) 



U : x(t) = /i(t), 
Hi: x(t) = [ h{£)s{t-£)d£ + fj,{t), 



III. System Model 

We consider a multipath fading channel model and assuming that there is only one primary 
user in the cogitative radio network. Let x(t) be a continuous-time baseband received signal at 
the secondary user's receiver. Spectrum sensing can be formulated as a binary hypothesis test 
between the following two hypotheses 

(6) 

(7) 
Jo 

where s(t) denotes the signal transmitted by the primary user, h(t) is the continuous channel 
response between the primary transmitter and the secondary receiver, fj,(t) denotes the additive 
white noise, the parameter T signifies the duration of the channel. The channel response is also 
assumed to remain invariant during each observation. To obtain the discrete representation, we 
assume that the received signal is sampled at rate f s which is equal to the reciprocal of the 
baseband symbol duration T . For notation simplicity, we define x(n) = x(nT ), s(n) = s(nT ) 
and /i(n) = fx{nTo). Hence, the corresponding received signal samples under the two hypotheses 
are described as: 

Hq : x(n) = /tt(n), (8) 



x{n) 

L-l 



U 1 : x{n) = J2 h ( 



i)s[n 



(9) 



i=0 



where h{i) (0 ^ i ^ L — 1) denotes the discrete channel response of h(t) and L denotes 
the order of the discrete channel (L taps). Let each observation consist of M received signal 
samples. Then © and © can be rewritten in matrix form as: 

T-io : Xi = Hi, (10) 



Hi : Xi = Hsi + Hi, 

where H is an M x (L + M — 1) circular channel matrix defined as 
'h(L-l) h(L-2) ... h(0) 

h(L-l) h(L-2) ... h(0) 



(11) 



H 



h(L-l) h(L-2) ... h(0) 
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Xi, Si, and /Xj are the M X 1 observation vector, (L + M — 1) X 1 source signal vector and Mxl 
noise vector, respectively, and are defined as 

Xi = [x(iM — M + 1), x(zM - Af + 2), . . . , x(^M)] T , (12) 

s t = [s(iM -M-L + 2), s(iM - M - L + 3), . . . , s(iM)] T , (13) 

^ = [n{iM -M + l),fi(iM - M + 2),..., fi(iM)] T . (14) 

Now, comparing (fTTI) with the array signal processing model (OQ), we find that a major difference 
is that the H in our considered system model is an under-determined matrix, i.e., the order 
of column is larger than the order of row. Therefore, the information theoretic criteria are not 
directly applicable here l|29l . 

To construct an over-determined channel matrix H as in (OQ), one needs to enlarge the ob- 
servation space. Obviously, simply increasing the observation window M does not work. Here 
we propose to expand the observation space using one of the following two methods. One is to 
increase the spatial dimensionality by employing multiple receive antennas at the secondary user 
and the other is to increase the time dimensionality by over-sampling the received signals. It 
turns out that the two methods are similar to each other. Hence we shall focus on the multiple- 
antenna approach hereafter. The difference for over-sampling method will be discussed in the 
end of this section. In specific, suppose that the detector at the secondary user is equipped with 
K antennas. Redefine (IT2]) and (TT4)) as 

Xi= [x{(l),xi(l),...,x^{l),x\{2),...,x) c (2),...,x{(M),...,x^{M)} T , (15) 

^ = |>i(l),A4(l), • • • > /4(2), • • • , /4(2), • • • , fA(M), . . . , /4(M)] T , (16) 

where x\ = [x l k (l), x\(2), . . . , x l k (M)] T represents the M x 1 observation vector at the k- 
th antenna at the 2-th observation as in (fT2l) . and fi l k = [/4(1), fJ>l(2), . . . , nl(M)] T is the 
corresponding noise vector at the k-th antenna at the i-th observation as in (fl4)) . Then, the 
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new channel matrix H becomes an MK x (M + L — 1) matrix: 
h(L-l) h x {L-2) ... h^O) 

h K (L - 1) h K (L - 2) ... MO) 

/ii(L-l) /n(L-2) ... /n(0) 



H 



h K {L - 1) h K {L - 2) ... h K (0) 



hi(L-l) hx{L-2) ... h^O) 



h K {L-2) ...MO) 



(17) 



Here, /ifc(z), for i = 0, ... ,L — 1, denotes the i-th channel tap observed at A;-th antenna. To 
ensure that H is now an over-determined matrix (the order of row is larger than the order of 
column), we need to have 



K > 



or, alternatively, 



M > 



L + M-l 

M : 

L — 1 



(18) 



(19) 



K- 1 

Furthermore, we assume the noise samples come form different antennas are independent with 
zero mean and £(iiinf) = <7 2 I M k- Then we can exactly ensure that the system mode under mul- 
tiple antennas satisfies the over-determined condition specified in [|29l . For ease of presentation, 
we define p = MK and q = L + M - 1 in (fTTT) . 

As mentioned earlier, the second approach to construct the over-determined channel model 
is for the secondary user to over-sample the received signals. Suppose that the over-sampling 
factor is given by K. That is, the received baseband signal is sampled K times in one symbol. 
Then a similar system model as in (fT5l) . (TT6l) and (fT7l) can be obtained, except that Xi and ^ 
should be replaced with 



Xi = [x(iMK - MK + l),x(iMK - MK + 2), 
fj,i = [n(iMK - MK + l),fx(iMK - MK + 2), 



x(iMK)f, 
,^MK)] T , 



(20) 
(21) 



s 



and h k (i), for i = 0, . . . , L — 1, becomes the k-th over-sampling point of the i-th channel tap. 
It can be verified that hfc(i)'s are different for different k [301. The major difference between 
the over-sampling approach and the multiple-antenna approach is that the over-sampled noise 
samples in (12TT) are correlated, which contradicts the primary assumption of independent noise 
samples. Nevertheless, the pre-whiting technique can be used to whiten the correlated noises 
based on the known correlation matrix. The details can be referred to Appendix [A] 

Before leaving this section, it is noted that, though the proposed over-determined model is 
based on the assumption that there is only one primary user in the cognitive network, it is 
also applicable the scenario where there exist multiple primary users. An alternative approach 
to construct the over-determined model in the presence of multiple primary users is to use the 
cooperative sensing technique as in [28] by using multiple detectors. 

IV. Simplified Information Theoretic Criteria Sensing Algorithm and 

Performance analysis 

Since the binary hypothesis test in the spectrum sensing is equivalent to the special case of 
source number estimation problem, the information theoretic criteria method can be directly 
applied to conduct spectrum sensing as firstly proposed in E51 - ll28~l . The basic idea is when the 
primary user is absent, the received signal cCj is only the white noise samples. Therefore, the 
estimated number of source signals via information theoretic criteria (AIC or MDL) should be 
zero. Otherwise, when the primary user is present, the number of source signals must be larger 
than zero. Hence, by comparing the estimated number of source signals with zero, the presence 
of the primary user can be detected. It is noted that the estimation of the number of source by 
using © and © needs very little prior information about the primary user. In particular, it does 
not require the knowledge of channel state information, synchronization, nor pilot design and 
modulation strategy. Moreover it does not need the estimation of noise power. Hence we argue 
that information theoretic criteria method is a blind spectrum sensing similar to [fTOll — [[T2l | . and 
it is robust and suitable for the practical applications. 

However, it is known that signal detection is much easier than signal estimation. Therefore, 
using the estimation method to conduct the detection as in Il25l - ll28l may lead to unnecessary 
computational complexity overhead. In the mean time, it makes it difficult to carry out analytical 
study on the detection performance. In this section, we propose a simplified ITC algorithm to 
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conduct the spectrum sensing. It can significantly reduce the computational complexity while 
having almost no performance loss as will be illustrated in Section V. It also enables a more 
tractable analytical study on the detection performance. 

A. Simplified ITC sensing algorithm 

Before presenting the simplified ITC sensing algorithm in detail, we have the following lemma. 
Lemma 1: If there is one value k(> 0) which minimizes the AIC metric in © (MDL metric 
in ©), then AIC(O) > AIC(l) (MDL(O) > MDL(l)) with high probability. 

Proof: Please refer to Appendix |B] ■ 
The outline of the proposed simplified sensing algorithm is as follows. 

Algorithm 1: SITC sensing algorithm 

Step 1. Compute the sampled covariance matrix of received signals, i.e., H x = — Y2i=i x i x i\ 
where ccj's are received vectors as described in (fT2~)) or (12"0l) and N denotes the number of the 
observations. 

Step 2. Obtain the eigenvalues of H x through eigenvalue decomposition technique, and 
denote them as {/i, l 2 , . . . , l p } with h > l 2 , . . . , > l p . 

Step 3. Calculate the decision values AIC(O) and AIC(l) (MDL(O) and MDL(l)) according 
to ©(©)• Then the detection decision metric is 

Tsitc-aic(L,) : AIC(O) 1 AIC(l). (22) 

Ho 

if AIC is adopted, or 

Tsitc-mdl(L,) : MDL(O) % MDL(l). (23) 

Ho 

if MDL is adopted, where L z denote the set of eigenvalues i = 1,2, . . . ,p} 

Note that in the OITC sensing algorithm ll2~51l . one needs to find the exact value of k from 
to p — 1 to minimize the AIC in or MDL in ©. In the proposed SITC algorithm, only 
two decision values at k = and 1 should be computed and compared. Thus, the computational 
complexity is significantly reduced. In the next subsection, based on the proposed SITC algo- 
rithm, we present the analytical results on the detection performance. Since from the Lemma 1, 
the SITC algorithm almost obtains the same performance as OITC algorithm, we claim that our 
analytical results are also applicable for evaluating the performance of OITC algorithm. 
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B. Performance Analysis 

Since spectrum sensing is actually a binary hypothesis test, the performance we focus on is 
the probability of detection Pd (the probability for identifying the signal when the primary user 
is present) and the probability of false alarm Pf (the probability for identifying the signal when 
the primary user is absent). As no threshold value is involved in the ITC sensing algorithm, 
Pd is not directly related with Pf. The two probabilities are presented separately. For ease of 
presentation, we shall take the AIC criterion for example to illustrate the analysis throughout 
this section. The extension to MDL criterion is straightforward if not mentioned otherwise. 

1) Probability of false alarm: According to the sensing steps in Algorithm 1, the false alarm 
occurs when AIC(O) is larger than AIC(l) at hypothesis "H . The probability of false alarm can 
be expressed as 

P^mc = Pr(AIC(0) > AIC(l)|fto). (24) 

Since the primary user is absent, the received signal x- t only contains the noises. The sampled 
covariance matrix in Algorithm 1 thus turns to defined as 

1 N 

R » = xz2i j ' i ^- (25) 

i=l 

Hence, the eigenvalues in become the eigenvalues of the sampled noise covariance matrix 
R/x in (1251) . which is a Wishart random matrix PDl . By applying the recent advances on the 
eigenvalue distribution for Wishart matrices, a closed-form expression for the probability of false 
alarm can be obtained. 

Proposition 1: The probability of false alarm of the proposed spectrum sensing algorihtm can 
be approximated as: 

piV _(ViV + ^p) 2 \ ( (p- ai )N-(VN + y/pf 

— ro 




where -F2O) is the cumulative distribution function (CDF) of Tracy-Widom distribution of order 
two OTI . «i and a 2 with ol\ < a 2 are the two real roots of the function in (|32|) if AIC is applied, 
or ([37]) if MDL is applied. 
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Proof: Recall the definition in (1241) . to compute the probability of false alarm is to compute 
the probability 

Pj-mc = Pr(AIC(0) - AIC(l) > 0\n ). (27) 



According to the cost function of AIC defined in ©, we have 



AIC(O) - AIC(l) = -2 log 
Then we can rewrite (1271 as 



v A/p' 



pN 



+2 log 



n 



p A/p- 1 

i=2 H 



Pf-Aic = Pr log 



p-lZ^i=2 

Ap-2 



(p-l)JV 



-(4p-2) 



27V 



(28) 



Note here, the sum of eigenvalues of sampled covariance matrix, - Ei=i k> is equivalent to 
^ Tr ^Ei=i ^i^i^J ■ At hypothesis if > where the received vector involves only the noise sam- 
ples, Tr ^X/ili ^t^iM is me un-biased estimation of the covariance of the white noise. 
Therefore, when TV is sufficiently large, we have 



1 p 

1 i=i 



a 



(29) 



Substituting d29j) into d28j) yields 



f-AIC 



Pr 



cr 



2\p 



'JP- a 2 _ J^)p-l h 



> exp 



2p- 1 

N 



(30) 



From (|30l) . it is seen that the probability of false alarm is only dependent on the largest eigenvalue 
of the noise sampled covariance matrix R M . Since R M is actually a Wishart random matrix , its 
the largest eigenvalue l\ satisfies the Tracy-Widom distribution of order two 11311 . To apply this 
result, we rewrite (l30l) as 



f-AIC 



Pr 



Pr 



h ( k 



a 



(J" 



p-l 



< 



(p-l) 



p-l 



exp ( V) 



x p _ px P~i + ^ -I _ > o 



"He 



"Ho 



(3D 



where x = p — \. 
Define a function 



f( x ) ^ x p - px p - 1 + 



(P-l) 



(32) 



We next find the real roots of this function. 
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Taking the differentiation of f(x) and equating it to zero, we obtain 

p(p — l)x p ~ 2 = px p ~ 2 [x — (p — 1)] = 0. 



d f( x ) p-i 
— px p 



dx 



Clearly, f(x) has two stationary points, which are x = p — 1 and x — 0. In the following, two 
scenarios with p being even or odd are considered respectively. When p is even, it is easily found 
that the function f(x) monotonically decreases over (— oo,p — 1) and monotonically increases 
over (p — 1, oo). Simultaneously, we can verify that 



and 



f( P - 1) = (p - iy - P {p - ir 1 + {p H^ 1 < o. 

exp N ) 

(p - Up- 1 

/(o) = f( P ) = {P ,1 ~ > o. 



(33) 



(34) 



So there must be two real real roots within (0,p) and around p — 1 for function f(x). Let a± 
and a 2 , with a\ < a 2 , denote the two real roots, then (T3T1) is converted into an equivalent form: 

Pf-Aic ~ Pr [x < ai\H ] + Pr [a 2 < x\%q] . (35) 

When p is odd, we can find f(x) decreases monotonically over (0, p— 1), while it is the monotonic 
increasing function over both (— oo, 0) and (p — 1, oo). According to the fact that /(— oo) < 0, 
f(0) > 0, f(p — 1) < and f(p) > 0, we conclude that f(x) have three real roots, which 
are denoted as ao, cti and ct 2 , with a < and < cti < a 2 , respectively. Then (I3TT) can be 
rewritten as: 

-Pf-A/c w Pr K < x < ai\H Q ] + Pr [a 2 < x\H Q ] . (36) 

However, it is noted that as iV is large enough, the largest eigenvalue of the sampled noise 
co variance matrix, 1%, is just slightly larger than the true co variance of noise a 2 . Hence, from 
the definition, x can be reasonably limited in (0,p). Therefore, both the probability of (1351) and 
d36l) can be summarized as the following form 

Pf-Aic ~ Pr [0 < x < oci\H Q \ + Pr [a 2 < x < p\H ] . 



In other words, 



f-AIC 



Pr p — «!< — < p 
a 2 



+ 



< 



h 



< p - a 2 
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Applying the distribution of the largest eigenvalue for Wishart matrix in random matrix theory 
11311 . the variable N\ satisfies the distribution of Tracy-widom of order two, i.e., 

A^-(v/jV + v ^) 2 

Here, W 2 and F 2 denote the probability density function (PDF) and cumulative density function 
(CDF) for distribution of Tracy-widom of order two respectively. Therefore, the probability of 
false alarm of AIC can be concluded as (l26l) . 

Similar with the above derivation, when the MDL criterion is applied, we only need to modify 
the step in (1311) as 



Pf-MDL ~ Pr 



X P - P X p - 1 + ^ '- r- > 

pxn ( (p-0.5)logjV N 

exp 1 N 



and redefine the function f(x) in (l32l) as 

fix) 4 x ^ px ^ + u , Yx . <37) 



exp( (p - a g logJV ) 



From Proposition 1, it is found that the probability of false alarm is independent with noise 
covariances a 2 . Therefore, the proposed SITC sensing algorithm is robust in practical applica- 
tions. It is also noted that Pf depends on the product of M and K, i.e., p = MK, rather than 
the individual values of M and K. 

2) Probability of detection: When the primary user is present, the event of detection also 
occurs when AIC(O) > AIC(l). The probability of detection is thus described as 

Pd-Aic = Pr(AIC(0) > AIC(1)|-H 1 ). (38) 

Since at Hi, the received vector ccj involves the signals transmitted by the primary user, the 
sampled covariance matrix H x can be written as 

1 - 

R - = ^X)( Hsi+ ^)( Hsi+ ^) t - (39) 

1=1 

Note that R z is no longer a Wishart matrix. The exact distribution of its eigenvalues is unknown 
and difficult to find, and hence so is the P^. In the following, we resort to deriving a closed-form 
expression for the conditional probability of detection given the channel matrix H. The average 
probability of detection can then be obtained using a hybrid analytical- simulation approach. 
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Proposition 2: Let R s denote the co variance matrix of Sj given in (IT3T ) and {61,82, ... , 8 P } 
be the eigenvalues of matrix HR S H^ (with 81 ^ 82 ^ • • • ^ Then there exists a value 
of p, for 8 p ^ p ^ 81, such that the probability of detection given H can be approximated as 
Pd\H ~ Q{p), where the function Q(-) is 

Q{5) = ^ WV-(^ + ^) 2 ^ _ Fa f (^)* " + ^ 




^#^)iv-(Viv + VP) 2> l ^ -(v^ + v ^) 2 



(40) 



where e = iTr(HR s H^) + a 2 and 7Ti, 7r 2 (with 7Ti < 7r 2 ) denote the two roots of the function 
(|46l ) for AIC or (|49l ) for MDL. Furthermore, upper and lower bounds can be obtained as Q(8 P ) ^ 

Pd\H < Q(^l)- 

Proof: Please refer to Appendix O ■ 
From Proposition 2, we find that P d is not only related to iV and p, but also depends on 
which is the ratio of the signal strength of primary user to the noise variance. The exact value 
of p G [8 p , 81] in Proposition 2 is difficult to determine in an analytical way, since it is related 
to both the channel response H and the co variance matrix of source signal R s . In practice, we 
can simply choose paic = \{8 P + 81) and Pmdl = M8p + Si). It will be demonstrated later in 
Section VI that the analytical P d \ H based on this choice of p can approximate the Monte Carlo 
results very well in most of cases. 

V. Generalized information theoretic criteria sensing algorithm 

As mentioned in the previous section, the probability of detection of and probability of false 
alarm of the proposed simplified ITC sensing algorithm are not directly related to each other 
as the algorithm does not involve any threshold (same for the original ITC algorithm in (251). 
According to the analytical results given in (l26l) and (l40l) . to satisfy different system requirements, 
a proper set of values for the parameters M, K and in model (fTTI) should be chosen, which 
is inconvenient for practical application. In this section, based on the analytical discussion in 
Section IV, we propose a generalized information theoretic criteria sensing algorithm which can 
provide a flexible tradeoff between P d and Pf according to different system design requirements. 
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From the expression given in (|28l ) and ([441) . we found that the sensing decision for SITC 



algorithm is actually based on the decision variable log 
the decision rule as 



. Thus, we generalize 



--i- r-r \ (pSr=i^) p ^1 .... 

7^rro L x ) : p ^ 7, (41) 

where 7 is a pre-set threshold. It is seen that if we set 7 = exp(^i), the decision rule 
given in (|4TT) turns into the AIC based SITC sensing algorithm presented in Algorithm 1. If 
we fix 7 = exp ( ^~ '^ losJV ), the algorithm becomes the MDL-based SITC sensing algorithm. 
Furthermore, it is easy to find that the analytical results obtained in Section IV are applicable 
for the GITC sensing algorithm. The only change that needs to be made is to replace cti and 
a 2 in (|26l) (or 7Ti and 7r 2 in (l40l) ) by two real roots generated by the following equation. 

f( x ) = x p — V x p ~ x + K - '- . (42) 

7 

Thus, the outline of the proposed GITC sensing algorithm can be summarized as follows. 

Algorithm 2: GITC sensing algorithm 

Step 1 and Step 2: the same as Algorithm 1 in Section IV. 

Step 3: According to the system requirement on Pf, choose a proper threshold 7 based on 
([26]) and (@2)>. 

Step 4: Conduct the decision based on (l4Tj) . 



According to the decision variable presented in (1411) . we find that the proposed GITC sensing 
algorithm is actually also an eigenvalue-based method similar to [fT0ll - [fT2l . The advantage of 
the proposed GITC over the algorithms in ifTTI . lfT2l is that it is able to analytically obtain 
the explicit decision threshold 7 according to the system requirement on Pf before the actual 
deployment. 



VI. Simulation results and discussions 

In this section, we present some numerical examples to demonstrate the effectiveness of the 
proposed sensing schemes and to confirm the theoretical analysis. 
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A. Comparison between simulation and analytical results for both SITC and OITC 

In our first set of examples, we compare the simulation results with analytical results given in 
Proposition 1 and Proposition 2. The comparison between SITC and OITC are also presented. 
In the simulation, the channel taps are random numbers with zero-mean complex Gaussian 
distribution. All the results are averaged over 1000 Monte Carlo realizations. For each realization, 
random channel, random noise and random BPSK modulated inputs are generated. We define 
the SNR as the ratio of the average received signal power to the average noise power 



The comparison of simulation and analytical results for Pf is demonstrated in TABLE U 
and TABLE lU According to Proposition 1, Pf is independent with the noise variance, thus 
remains constant over different SNR. Hence we average multiple values over different SNR as 
the simulated Pf and compare it with the analytical Pf. From TABLE [H we first observe that 
SITC and OITC perform almost the same. It is also seen that, for AIC, the analytical results 
are slightly larger than the simulation results especially when p = MK is small. Nevertheless, 
the analytical approximation is accurate enough to evaluate the performance of the proposed 
sensing scheme. It is also found that Pj_aic gradually decreases as p = MK increases while 
the Pf—MDL remains zero in both simulation and analytical results. We conclude that the MDL 
method has excellent false alarm performance. From TABLE HH we find that the probability of 
false alarm increases very slowly as N increases. In fact, our simulation shows that Pf_Aic is 
still below 0.1 even when N = 10 15 at M = 5 and K = 4. 

Figs. CDIH show Pd at different system parameters. In Fig. [Q we first compare the detection 
performance obtained by simulation between SITC and OITC. It is seen that the proposed SITC 
sensing algorithm do not lead to any performance loss compared to OITC algorithm. Then, 
comparing the semi-analytical results obtained from Proposition 2 with the simulation results, 
one can observe a very good match between them, especially for MDL method. Thus, Proposition 
2 is validated. Fig. [2l Fig. [3] and Fig. @] present the simulation results of Pd for variable K (at 
M = 5, N = 10000), M (at K = 4, N = 10000) and N (at K = 4, M = 5), respectively. It is 
found that the performance is improved as any of these parameters increases. 



SNR 



S[\\xi - A*i || 2 ] 



(43) 
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B. Comparison between SITC and other sensing algorithms 

Thus far, a few efficient sensing algorithms have been proposed in the literature, with each 
requirng distinct prior information. In this subsection, for fair comparison, we only choose 
the eigenvalue-based methods proposed in [[T0ll - [fT2l and the energy detection method since 
they both need little prior information. It should be mentioned that the proposed SITC-AIC 
and SITC-MDL algorithms are equivalent to the GITC algorithm provided in Section V via 
setting 7aic = exp ( ) and 7mdl = exp ( ^~ 0, ^ logAf ). Therefore, we omit the performance 
comparison with the GITC. In the simulation, we fix the order of channel L — 10 as in IfTOl 
and choose N = 10000, K = 4 and M = 5. Fig. \5\ shows the comparison with the energy 
detection (ED) method (with perfect estimation of noise covariance) and the four eigenvalue- 
based methods, namely, the maximum minimum eigenvalue detection (EV-MME) and energy 
with minimum eigenvalue detection (EV-EME) |fj(')|. the blindly combined energy detection (EV- 
BCED) [lTJ, and the arithmetic to geometric mean (EV-AGM) lfi~2l . We see that, under almost 
the same Pf, energy detection performs the best, followed by the EV-AGM method, the proposed 
SITC-AIC method and EV-BCED method, and then EV-MME and EV-EME methods. Among 
the proposed SITC-AIC and four eigenvalue-based methods, the SITC-AIC almost obtains the 
same performance with the EV-BCED method and they both outperform EV-MME and EV- 
EME while being slightly inferior to the EV-AGM method. Though the proposed SITC-MDL 
method performs the worst in P d , it is the best among all the considered schemes in terms of 
Pf performance. 

The comparison with energy detection with noise uncertainty is presented in Fig. [6l where 
"ED-x dB" means that the noise uncertainty in energy detection is x-dB as defined in iflOl . It is 
observed that, although the proposed method performs worse than the energy detection method 
with accurate noise covariance estimation, it significantly outperforms in both P d and Pf when 
there exists some noise uncertainty. This clearly demonstrates the robustness of information 
theoretic criteria based blind sensing algorithm. 

C. Performance of the GITC algorithm 

Results for the GITC sensing algorithm at different threshold values are demonstrated in 
Fig. H It is assumed that we should choose proper thresholds to make Pf = 0.1, Pf = 0.05 and 
Pf = 0.01. According to the Proposition 1 and the discussion in Section V, we choose three 
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thresholds 7 = 1.0372, 7 = 1.0393 and 7 = 1.0429 (note that since the analytical results are 
slightly larger than the simulation results, the thresholds we choose should make theoretic Pj 
larger than required Pj by about 0.02). From the plots, it is found that the Pj requirements 
are satisfied very well. One can also see that the probability of false alarm is very sensitive to 
the threshold. Hence, the GITC sensing algorithm is flexible for system design with different 
requirements. 



In this paper we have provided an intensive study on the information theoretic criteria based 
blind spectrum sensing method. Based on the prior work on the related study, we first proposed 
the simplified ITC sensing algorithm. This algorithm significantly reduces the computational 
complexity without losing any detection performance compared with the existing ITC based 
sensing algorithm. Moreover, it enables a more trackable analytical study on the detection 
performance. Thereafter, applying the recent advances in random matrix theory, we derive 
closed-form expressions for both the probability of false alarm and probability of detection 
which can tightly approximate the actual results in simulation. We further generalized the SITC 
sensing algorithm to an eigenvalue based sensing algorithm which strike the balance between the 
probabilities of detection and false alarm by involving an adjustable threshold. Simulation results 
demonstrate that the proposed blind sensing algorithm outperforms the existing eigenvalue-based 
sensing algorithms in certain scenarios. 



At the secondary receiver, the received continuous signal is usually filtered by a low-pass 
filter. Therefore, the noise in © and © should be correlated. We assume that the white 
noise before the filter is fi(t) and the system function of the low-pass filter is g(t) which is 
known at the secondary receiver. In the following, we only consider the real value case, since in 
the communication system, the complex value signal is just the combination of two orthogonal 
real value signals. As we have known, n(t) can be described by fi(t) and g(t) as 



VII. Conclusions 



Appendix A 



Whitening the over-sampled noises 
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where (0, t max ) represents the time span of g(t) and <8> denotes the convolution operator. Thus, 
the auto-correlative function of fi(t) denoted by M (r) can be expressed as 

where g (r) and 4>p,(r) are the auto-correlative functions of g(t) and fi(t), respectively. Note 
that (j)p,(r) should be equal to a 2 5(r) since ft(t) is white (here the covariance of fi(t) is assumed 
to be cr 2 ). Therefore, we derive that 

<^(r) = a 2 cf) g (T) = a 2 / g(£)g( T - £)d£, < r < 2t max 

Jo 

Thus, if the received signal is over-sampled at rate Kf s where f s is the reciprocal of the baseband 
symbol duration T and K is the over-sampling factor, the covariance matrix of the noise vector 
Hi given in (TfTI) becomes 

R M = <x 2 Q, 

with Q having entries q iy j = <j> g {\i— Note that Q is a positive definite symmetric matrix. It 
can be decomposed into Q = Q 2 , where Q is also a positive definite symmetric matrix. Hence, 
to obtain the independent noise samples in the over-sampling scheme, we can pre-whiten the 
over-sampled noise samples Hi as 

Hi = Q~Vi 

Then, the covariance matrix of fli transforms into 

Now, noise samples Hi arQ whitened. It is noted that Q is only related to the low-pass filter and 
over-sampling factor K and is independent to the signal and noise. Therefore, the pre-whitening 
process can be used blindly. 

Appendix B 
Proof of Lemma 1 

We prove the lemma from two aspects. Firstly, it has been shown in ll32l . Il33l that most of 
the estimation errors of AIC and MDL occur tightly around the true numbers. According to 
this finding, at hypothesis "H (the true number of source signal is zero), if there exists k > 
minimizing |2| or ©, then we have k — 1 with high probability. Hence, Lemma 1 holds for 
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the case of false alarm. Next, we prove that Lemma 1 succeeds at hypothesis Hi. Since the 
primary user is present, the eigenvalues ^ of the sampled covariance matrix are distinct at least 
for i = 1,2, ... ,q (here q is the true source number). For i = q, q + 1, . . . ,p, the eigenvalues 
are actually the estimation of noise variance a 2 . They may be equal to each other when N is 
enough large. According to the expression of AIC and MDL, it is found that the second terms 
in © and © are monotonically increasing functions of k. To make the cost function in © or 
© minimum at fe G — 1], we must have that the first terms in © and © are monotonically 
decreasing for k — 0, 1, . . . , fe. We next prove this statement. 

We focus on the AIC criterion and the extension to MDL is straightforward. Supposing k' G 
[2, fe] and 

TIP jl/(p-k)\(P- k ) N 
lli=fc+l i \ 



/ A ic(&) = -21og 



— V p I 

p-k Z^i=fc+i '» 



we have 



f Alc (k'-l)-f Alc (k') = 2N\og 



I v p I 



p—k'+l 



p-k'+l 



-k' <L^i=k'+i fr « 



p—k' 



Since 



p-fe'+i 



p—k' 



p — k' p — k' + 



k , 1 

> iiH 

' + 1 ^ p-fe' + i 



p-fc'+i 



> 



p-k! ^ 

1 i=k'+l 



i=k'+l 

p — k' 
p-k' + l 



p — K + 
! -i p-k'+l 



p-fc'+i 



p-fc' 



p — k 



I', 



=fc'+i 



(here, the arithmetic-mean geometric-mean inequality x" 1 + x 2 2 > x^x^ 2 with ai + a 2 = 1 is 
applied), we conclude that 



i V p I 



p-k'+ 



p-k'+l 



p-k' 



> 1. 



It further means 



/ A ic(fc'-l)-/Aic(A/) >0, 
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i.e., /aic(^) is a monotonic decreasing function. Hence, we have 




-Ap + 2 

N 



> 0. 



If N is finite but larger enough, we claim that Lemma 1 holds with high probability. The high 
probability is also contributed by the fact that, due to the property of SVD decomposition 
technique, the first eigenvalue li is always much larger than other eigenvalues. Therefore, 
2N log ^ p ^ is larger enough to make Lemma 1 succeed at hypothesis Hi. Thus, 

we complete the proof of Lemma 1. 



We firstly derive the derivation of the probability of misdetection P m (the probability for 
misdetecting the presence of primary user at hypothesis Hi), then obtain the probability of 
detection Pd through 1 — P m . Without loss of generality, the following derivation is also based 
on AIC. According to (|38l) . we have 



Similar to the process described in the proof of Proposition I, we can rewritten P m -Aic\H as 



Where {li,l 2 , ■ ■ ■ , l p } are the decreasing ordered eigenvalues of the sampled covariance matrix 
R x in (|39l) . When the number of observation N is larger enough, we obtain the approximation 
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Proof of Proposition 2 



P m -Aic\H = Pr[AIC(0) - AIC{1) < 0|fti]. 




(44) 




i=l 



Thus 



V 




V 



i=i 



Hence, (1441) turns to 




Pr y p -py p ~ 1 + 



(p- If- 1 

^(¥) 



< H 



(45) 
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where e = ±Tr(HR s H) + a 2 and y = p - |. 

Assuming tx\ and n 2 (with 7T! < 7r 2 ) are two real roots within (0,p) of the following function 



g(y) = y p - m/ p + 



As described in the proof of proposition 1, the probability of misdetection is concluded as 

Pm-AIC\H « Pr[7Ti < y < 7r 2 \Hi}, 



(46) 



i.e., 



Pm-AIC\H « Pr[(p - 7T 2 )e < ^ < (p - 7Ti)e|7^i]. 



(47) 



Note that Zi is the largest eigenvalue of the sampled variance matrix H x . Given the channel 
matrix, H x can be approximated as 



1 

x N 



N 



i=l 



1 - 1 

i=l i=l 

when AT is larger enough. 

Let {5i, 82, ■ ■ . , 5 P } and {xi, X2, • • • , Xp} be the decreasing ordered eigenvalues of HR s Ht 
and jr^2 i=1 HiHi respectively. Apply Weyl's inequality theorem in [|34l , the largest eigenvalue 



of R x , li, satisfies 



Equivalently xi satisfies 



Xi + S P < *i < Xi + 



h — Sx^Xi^h — S p 



(48) 



Therefore, there must exist a constant p satisfying S p ^ p ^ 5\ which makes l\ — p equal to xi- 
Then (1471) is rewritten as 



m-AIC\ 



H « Pr[(p - 7r 2 )e - p < Xi < (P - ^l) 6 - p\Ki] 



i.e., 



Pd~AIC\H ~ Pr 



(p-7Ti)e-p ^ xi 



< — < p ffj 



+ Pr 



0< XI < (P-K 2 )e-P i 
a z a z 



where we use the similar constraint for 2% as in the proof of Proposition 1 . Since xi converges 
to the Tracy-Widom distribution of order two, we conclude 



P. 



d-AIC\H 



Q{p), 
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where Q(-) is defined in Proposition 2. Simultaneously, based on (|47l ), the upper and lower 
bounds for P m - A ic\H is 

l-Q(S l ) <P m -AIC\H<l-Qtfp)- 
Therefore, the upper and lower bound of Pd-Aic\H can be obtain straightforwardly as 

Qi&p) < P d -AIC\H < Q(Sl). 

The proof for MDL criterion is the same, except that the function g(y) in (l46l) is redefined as 

g(y) = y>- py^ + ^ 
exp ( (P ) 

Proposition 2 is thus proved. 
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TABLE I 

Probability of false alarm with different p = M x K at N = 10000 

6 = 2x3 12 = 3x4 20 = 5x4 24 = 4x6 35 = 5x7 

Simulation results for SITC-AIC 0.0948 0.0770 0.0594 0.0541 0.0460 

Simulation results for OITC-AIC 0.0972 0.0773 0.0597 0.0541 0.0470 

Analytical results for SITC-AIC 0.1360 0.1036 0.0791 0.0711 0.0550 

Simulation results for SITC-MDL 

Simulation results for OITC-MDL 

Analytical results for SITC-MDL 



TABLE II 

Probability of false alarm with different N at p = MK = 20 

N = 1000 N = 5000 N = 10000 

Simulation results for SITC-AIC 0.0421 0.0558 0.0594 

Simulation results for OITC-AIC 0.0421 0.0561 0.0597 

Analytical results for SITC-AIC 0.0581 0.0744 0.0791 

Simulation results for SITC-MDL 

Simulation results for OITC-MDL 

Analytical results for SITC-MDL 
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Fig. 1. Simulation and theoretic results about probability of detection at different (M,K,N) for both SITC and OITC. 
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Fig. 2. Probability of detection for different K at M = 5 and N = 10000. 
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Fig. 3. Probability of detection for different M at K — 4 and N — 10000. 
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Fig. 4. Probability of detection for different iV at M = 5 and K = 4. 
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Fig. 5. Comparison with the eigenvalue-based methods and the energy detection method at M = 5, K = 4 and N = 10000. 
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Fig. 6. Comparison with energy detection with noise uncertainty at M = 5, K = 4 and N = 10000. 
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Fig. 7. Simulation results for GITC algorithm for different P f at M = 5, K = 4 and N = 1000. 



